Topology-Aware Communication in Wide-Area Message-Passing
نویسنده
چکیده
This position paper examines the use of topology-aware communication services to support message-passing in wide-area, distributed environments, i.e., grids. Grid computing promises great benefits in the flexible sharing of resources but poses equally great challenges for highperformance computing, that is to say, how to execute large-scale computations on a grid with reasonable utilization of the machines involved. For wide-area computations using a message-passing paradigm, these issues can be addressed by using topology-aware communication, i.e., communication services that are aware of and can exploit the topology of the network connecting all relevant machines. Such services can include augmented communication semantics (e.g., filtering), collective operations, content-based and policy-based routing, and managing communication scope to manage feasibility. While such services can be implemented and deployed in a variety of ways, we propose the use of a peer-to-peer, middleware forwarding and routing layer. In a related application domain (time management in distributed simulations) we provide emulation results showing that such topology-awareness can play a major role in performance and scalability. Besides these benefits, such communication services raise a host of implementation and integration issues for their operational deployment and use in grid environments. Hence, we discuss the need for proper APIs and high-level models.
منابع مشابه
Design and Implementation of Adaptive Message Passing Systems for Wide-Area Distributed Computing Environments
Recently, much research has gone into wide-area message passing systems, but more work is necessary so that message passing systems can adapt to wide-area environments by themselves and stop requiring manual configuration. Thus, in this paper, I make two proposals concerning the design and implementation of adaptive message passing systems for wide-area, distributed environments. My first propo...
متن کاملCollective Operations for Wide-Area Message Passing Systems Using Dynamically Created Spanning Trees
We propose a configuration-free method to perform collective operations efficiently in dynamically changing topologies. Our collective operations are designed so that (1) they perform well when the topology is stable, (2) they complete successfully even when processors join or leave, and (3) they adapt to topology changes. We propose to create adaptive latency-aware spanning trees for short mes...
متن کاملDynamic Topology Selection for High Performance MPI in the Grid Environments
MPI (Message Passing Interface) is getting more popular and important even in the Grid, but its performance still remains a problem, which is caused by the communication bottleneck on wide area links. To overcome such performance wall problem, we propose a dynamic topology selection which is a kind of resource selection method. It provides an effective resource selection service based on four p...
متن کاملCollective operations for wide-area message passing systems using adaptive spanning trees
We propose a method for wide-area message-passing systems to perform broadcasts and reductions efficiently using latency and bandwidth-aware spanning trees constructed at run-time. These trees are updated when processes join or leave a computation, allowing effective execution to continue. We have implemented our proposal on the Phoenix Message-Passing Library and performed experiments using 16...
متن کاملAn Efficient Group Communication Architecture over ATM Networks
NYNET (ATM wide-area network testbed in New York state) Communication System (NCS) is a multithreaded message-passing tool developed at Syracuse University that provides low-latency and high-throughput communication services over Asynchronous Transfer Mode (ATM)-based highperformance distributed computing (HPDC) environments. NCS provides exible and scalable group communication services based o...
متن کامل